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' Abstract 

W)| We study the structure of satisfying assignments of a random 3-Sat formula. In particular, 

^ ' we show that a random formula of density a > 4.453 almost surely has no non-trivial "core" 

'^1^ assignments. Core assignments are certain partial assignments that can be extended to satisfying 

assignments, and have been studied recently in connection with the Survey Propagation heuristic 
CO I for random Sat. Their existence implies the presence of clusters of solutions, and they have 

been shown to exist with high probability below the satisfiability threshold for fc-SAT with 
fc > 9 [1]. Our result implies that either this does not hold for 3-Sat or the threshold density 
for satisfiability in 3-Sat lies below 4.453. The main technical tool that we use is a novel simple 
application of the first moment method. 
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1 Introduction 

The study of random instances of 3-Sat has been a major research focus in recent years, both 
f"^ I because of its inherent interest and because it is a natural test case for the wider understanding of 

OO ■ the complexity of computational tasks on random inputs. In random 3-Sat the input is a formula 

drawn uniformly at random from all formulas of fixed density a, i.e., formulas with an clauses on n 
variables. Friedgut fl\ proved that there exists a function ac{n), known as the satisfiability threshold, 
' such that for any positive e, random formulas of density adn) — e have satisfying assignments with 

' high probability, and random formulas of density adn) + e have no satisfying assignment with 

high probability. It is conjectured that ac{n) is a constant (and that its value is about 4.27), but 
I currently all that is known is that, for large n, 3.520 < adn) < 4.506 [TTl \8\\6\. 

^ • In the range of densities for which the formula is satisfiable with high probability, the inter- 

esting algorithmic question is whether we can find even one of the many satisfying assignments in 
polynomial time. The lower bound on adn) is a result of the analysis of such a polynomial time 
algorithm [ll^ [8] . This algorithm belongs to a family of algorithms known as "myopic" because 
they assign variables greedily one by one in an order that is based only on the number of positive 
and negative occurrences of each variable. 

An apparently much more powerful algorithm is Survey Propagation |17lll8j . In experiments on 
very large instances (say, with n = 10^ variables) it finds solutions for formulas of densities only just 
below the conjectured threshold value a = 4.27; however, a rigorous analysis of its performance is 
still far from our reach. Like the myopic algorithms it also assigns variables one by one in a greedy 
manner, but its choices are based on more global information about the role of each variable in 
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the formula. That information is provided by the fixed point of a sophisticated message passing 
dynamics between variables and clauses. 

The message passing procedure is based on an intriguing (informal) picture of the properties 
of the solution space of a random formula, which is derived from the 1-step Replica Symmetry 
Breaking ansatz of statistical physics [16]. A key postulate of this physical picture is that, for 
formulas of density higher than a certain value (estimated to be 3.92), the space of solutions is split 
into "clusters." Within the same cluster it is possible to reach any satisfying assignment from any 
other by flipping one variable at a time, while always keeping the formula satisfied. On the other 
hand, in order to get from a solution in one cluster to a solution in another, the values of a linear 
number of variables have to be flipped at the same time. Loosely speaking, the message passing 
procedure of Survey Propagation was proposed as a way to collect information about the clusters 
of assignments, rather than individual assignments. 

The remarkable performance of Survey Propagation is a compelling reason to explore properties 
of the solution space of typical formulas as a way to further our understanding of random 3-Sat, 
its hardness, and the satisfiability threshold, and also (looking much further ahead) in order to 
systematically design algorithms for more general problems with distributional inputs. 

A detailed study of the Survey Propagation algorithm undertaken in [14] and [3] led to an inter- 
pretation of the message passing procedure as the more familiar Belief Propagation algorithm [19] 
applied to a particular probability distribution on partial assignments, i.e., assignments of values 
from the set {0, 1, *}. Here * is to be interpreted as "unassigned," and a variable is allowed to be 
unassigned only if it is not forced to be assigned or 1 in order to satisfy a clause. 

Among these partial assignments, the set of "core" assignments plays a central role. These are 
partial assignments that are obtained from a satisfying assignment by successively replacing each 
unconstrained variable by *. (A variable is unconstrained if changing its value does not make any 
clause unsatisfied.) Any satisfying assignment has a unique corresponding core. Moreover, since 
all assignments in a cluster have the same core, cores can be viewed (informally) as "summaries" 
of clusters. Of course, this view is useful only if different clusters tend to have distinct non-trivial 
cores. (The trivial core assignment is the one without any assigned variables.) Recently, Achlioptas 
and Ricci-Tersenghi [1] showed that, in random A;-Sat for k >9, for some range of densities up to 
the satisfiability threshold, with high probability every satisfying assignment has a non-trivial core 
assignment associated with it. This implies that clusters have a large number of frozen variables; 
indeed, for large k the fraction of frozen variables comes arbitrarily close to 1. (The clustering 
picture has also been confirmed for 8-Sat by [15] and [1] using a different method that does not 
say anything about cores.) 

The above results hold only for random /c-Sat with /c > 8 or 9. In this paper we investigate 
similar questions for the apparently harder case of random 3-Sat. Our main result is the following: 

Theorem 1 For random instances of ^-Sat with density greater than 4.453, with high probability 
there exist no non-trivial core assignments. 

This theorem requires some interpretation. Note first that the density 4.453 lies above the 
conjectured threshold value of 4.27 but below the current best known upper bound of 4.506. Thus 
we may deduce: 

Corollary 2 One of the following statements holds for random 5-8 AT; 
• o:c{n) < 4.453; or 
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• there is a range of densities immediately below the satisfiability threshold for which with high 
probability there are no non-trivial core assignments. 

One interpretation of Theorem [1] is as evidence for an improved upper bound on the threshold of 
the form adn) < 4.453. On the other hand, if in fact adn) > 4.453 then the theorem estabUshes 
a range of densities immediately below the threshold for which with high probability the formula 
is satisfiable but has no non-trivial core assignments. This would represent a surprising difference 
between the properties of random 9-Sat and random 3-Sat. Interestingly, experiments with 3-Sat 
and solutions of large formulas found by Survey Propagation do not find cores (see [BJ ) . It is quite 
conceivable that both of the statements in the above corollary hold. 

We now briefly discuss our proof technique, which involves a novel application of the first mo- 
ment method and is, we believe, perhaps more noteworthy than the result itself. A straightforward 
application of the first moment method to the set of core assignments only allows us to show that 
with high probability there are no cores of small size or of large size. To handle core assignments 
of intermediate size, it is further necessary to bound the probability that a partial assignment can 
be extended to a satisfying assignment. This probability is equivalent to the probability that a 
random formula with a given density of 2-clauses and 3-clauses is satisfiable. For this sub-problem, 
it is natural to use one of the methods that have been previously introduced for bounding the prob- 
ability of satisfiability of random 3-Sat formulas [101 [Sj [T3l El [T2l [6j. However, the most powerful 
method, from [6], is very heavy numerically, and it does not seem possible to carry it out for the 
whole range of required densities; on the other hand, simpler methods such as those in [13] are 
apparently not powerful enough. Instead we introduce a new method, which we now briefly outline 
(for a detailed description, see Section [3?2]) . 

Traditionally, bounds on the probability of satisfiability such as those mentioned above are 
obtained by applying the first moment method to a random variable which counts the number of 
satisfying assignments of a particular kind. Indeed, much of the work on bounding the satisfiability 
threshold has been directed towards identifying a set of satisfying assignments that is a strict subset 
of the set of all satisfying assignments (so that its expected size is much smaller), but is always 
non-empty if the formula is satisfiable. The novelty in our approach lies in identifying a new random 
variable which depends not only on satisfying assignments but also on partial assignments. This 
random variable is at least 1 for every satisfiable formula, it exploits the clustering structure of the 
solution space, and most importantly, it is significantly easier to compute than many alternative 
approaches. 

Finally we note that the proof of our theorem depends on a claim that a particular analytic 
function takes only negative values in a given range. We do not provide a complete proof of this 
numerical claim, but we outline the steps needed for completing the proof, and give very strong 
numerical evidence that all the corresponding statements hold. The difficulty with making the 
proof completely rigorous is that this would require too much computational effort, which we feel is 
not justified at this stage given that the bound we obtain is still some distance from the satisfiability 
threshold. The results presented here should rather be considered as a proof of concept. 

The remainder of this paper is structured as follows. In Section 2 we give necessary background 
and precise definitions of the various concepts used in the paper. Section 3 is devoted to the proof 
of our main result. Theorem [TJ We conclude in Section 4 with some final remarks and suggestions 
for future work. 
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2 Technical definitions 



Let set of n Boolean variables. A literal is either a variable or its negation. A 3-Sat 

formula is a formula in CNF, where each clause is a disjunction of 3 distinct literals (on different 
variables). For every clause c we will denote the set of variables that appear in the clause by V{c). 
The distribution that we consider is the following: for a given density a we choose uniformly at 
random and with replacement m = [an\ clauses out of all possible clauses on 3 distinct variables. 
For a clause c and a variable Xi E V{c) we denote by Sc,i and iic,j the value for variable Xi that is 
respectively satisfying and unsatisfying for clause c. 

Suppose that the variables x = (xi, . . . , x„) are allowed to take values in {0, 1, *}, which we refer 
to as a partial assignment. A variable taking value * (star) should be thought of as unassigned. 

Definition 1 A partial assignment to x is invalid for a clause c if either (a) all variables are 
unsatisfying; or (b) all variables are unsatisfying except for one index j S V{c), for which xj = *. 
Otherwise, the partial assignment is valid for clause c. We say that a partial assignment is valid 
for a formula (or just "valid") if it is valid for all of its clauses. 

For a valid partial assignment, the subset of variables that are assigned either or 1 values can 
be divided into constrained and unconstrained variables in the following way: 

Definition 2 We say that a variable Xi is the unique satisfying variable for a clause c if it is 
assigned Sc,j whereas all other variables in the clause (i.e., the variables {xj : j S V{c)\{i}}) are 
assigned Ucj- A variable Xi is constrained by clause c if it is the unique satisfying variable for c. 

A variable is unconstrained if it has or 1 value, and is not constrained by any clause. Thus for any 
partial assignment a £ {0, 1, *}" the variables are divided into stars, constrained and unconstrained 
variables. Let S*{a) be the set of unassigned variables, and n^{a) and no(cr) denote respectively 
the number of stars and the number of unconstrained variables. We define the weight of a valid 
partial assignment to be 

^(a) :=p"-('^)(l-p)"°('^), (1) 

where p is a parameter in the interval [0, 1]. The weight of an invalid partial assignment is 0. 

In [T3] the Survey Propagation algorithm is interpreted as a special case of a larger family of 
Belief Propagation algorithms applied to a family of distributions on valid partial assignments. This 
family of distributions, parameterized by p, is defined as Pr[cr] oc W{a). At one extreme, p = 0, 
this becomes just the uniform distribution over (full) satisfying assignments. The other extreme, 
p = 1, corresponds to Survey Propagation. The pure version of Survey Propagation corresponds to 
setting p = 1. Intermediate values of p interpolate between these extremes. 

Next we define a natural partial order (represented by an acyclic directed graph) on valid partial 
assignments. The vertex set of the directed graph G consists of all valid partial assignments. The 
edge set is defined in the following way: for a given pair of valid partial assignments a and r, the 
graph includes a directed edge from a to r if there exists an index i £ {1, . . . ,n} such that (i) 
aj = Tj for all j i; and (ii) Ti = * and ai ^ *. 

Valid partial assignments can be separated into levels based on their number of star variables, 
i.e., the assignment a is in level n^:{a). Thus every edge goes from an assignment in level / — 1 to 
one in level /, where 1 < I < n. G is acyclic and we write r < cj if there is a directed path in G 
from cr to r. In this case we will also say that the assignment r is consistent with the assignment 
a. The outgoing edges of any valid partial assignment a correspond to its unconstrained variables, 
and therefore its outdegree is equal to no((T). The minimal assignments in this ordering are the 
assignments without unconstrained variables, i.e., the positive weight assignments for p = 1. 
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Definition 3 A core of a satisfying assignment cr is a minimal assignment r such that r < cr. 

The following proposition about cores is proved in [14] . 

Proposition 3 /i^/ Any satisfying assignment a has a unique core. Furthermore, if satisfying 
assignments a^,a'^ G {0, 1}" belong to the same cluster of solutions then they have the same core. 

In the above proposition a cluster is simply a connected component of the graph on solutions, 
in which two solutions are connected by an edge if and only if they are at Hamming distance 1. 

Definition 4 A cover is a valid partial assignment that contains no unconstrained variables. 

In particular, the core of any satisfying assignment is a cover. On the other hand not all cover 
assignments are cores (because they may not be extendable to satisfying assignments). We say that 
a cover assignment r is non-trivial if n^:{T) < n, so that it has at least one assigned variable. 

The proof of Theorem [T] uses a surprising property of the weights ([T]) , which was observed 
in [H] but was not utilized there. (A more general statement and a connection to a combinatorial 
object known as "convex geometry" was developed in [2].) Specifically, the total weight of partial 
assignments consistent with a given satisfying assignment is exactly 1. This fact implies that the 
probability of satisfiability is at most the expected total weight of partial assignments. 

Theorem 4 mJ For every p G [0,1], ET<a l^(^) = P"*^'^'' f"^ 

any valid partial assignment a G 
{0, 1,*}". In particular, if a is a satisfying assignment then X]r<(j^(''") ~ ^■ 

In our proof, we will apply Theorem [4] with various values for p G (0.8, 1). In each application 
we will chose the value of p to get the best bound possible for the probability that a formula chosen 
from some distribution has a satisfying assignment. In particular, if a satisfying assignment exists, 
the total weight of valid partial assignments is at least 1. Therefore, the probability of satisfiability 
is at most as large as the expected value of the total weight of partial assignments. Applying 
this idea directly to the original distribution on 3-Sat formulas leads to an upper bound on the 
threshold Oc that is weaker than the currently best known bound of 4.506. However, the derivation 
is simpler than other approaches, which makes it possible to apply the same method to bound 
the probability that a fixed valid partial assignment can be extended to a satisfying assignment. 
(This amounts to applying the method to random formulas coming from a variety of distributions 
on formulas with both 2-clauses and 3-clauses.) This allows us to estimate the probability of the 
existence of a non-trivial core, and thus to prove the theorem. 

3 Proof of the main theorem 

This section contains a proof of Theorem [TJ We begin with an overview of the entire proof, in the 
course of which we will state various technical lemmas; these lemmas will be proved in the three 
subsections that follow. 

Note first that for a > 4.506 the statement of the main theorem follows from the fact that 
random 3-Sat formulas of density at least 4.506 are known to be unsatisfiable with high proba- 
bility [6]. Hence from now on we focus on the case that a G [4.453,4.506]. Our goal is to prove 
that, for densities above 4.453, with high probability there are no non-trivial covers that can be 
extended to satisfying assignments, i.e., there are no non-trivial cores. 

To this end, define the size of a cover (or of a core) to be the number of variables assigned value 
or 1. The following lemma, proved in [T3], establishes that with high probability all non-trivial 
covers (and consequently cores) are of linear size. 
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Lemma 5 Ji^j / For a random 3-Sat formula of density a, with high probability there are no non- 
trivial covers of size strictly less than -^n. 

This lemma implies that it is sufficient to consider core assignments of size an for a G [l/(ae^), 1]. 
Let Xa denote the number of cover assignments of size \an\ , and Ya denote the number of core 
assignments of size [onj . 

In addition to the density a, the size measure a, and the weight parameter p £ [0,1] from 
Theorem U we will use two further parameters, d and b, whose precise definitions will be given 
in the proofs in the following subsections. Roughly speaking, d is the fraction of clauses that 
are constraining with respect to a given partial assignment, and b is the fraction of constrained 
variables with respect to a given partial assignment. The ranges of these parameters are d £ {a/a, 1] 
and b £ [0, 1 — a] . We will also make use of a derived parameter r which is defined by d, a and 
a. It appears in connection to the event that the constraining clauses succeed in constraining all 
constrained variables. Specifically, r is the value satisfying the equation d = ^In^:^. (Note that 
such a value r always exists and is unique as the right-hand side is monotonic in r.) In fact, because 
of the form of this expression, it will be more convenient to think of d as being determined by r, 
a, and a. Whenever we take the supremum over one of these parameters, we always mean the 
supremum over its allowed range. 

We now define two functions that play a central role in our analysis: 

f{a,a,r) := aln{2) + H (a) + aH{d) + adln I— j + a{l - d) In I 1 

+ adln{r/e) — aln(r — 1); 



h{a,a,r,p,b) := 61n(2) + (1 - a - 6) ln(p) + (1 - a)i?(6/(l - a)) 

„, 6(6 - 56 - 15a) + 12(1 - a)a / ~3a(i-d) "C'+f'') 

— ry h — d)h — - - I- 6 n \ ^ — ne 2 4-a2 3-a ) 

a)t) 2(4-a2(3-a)) +^^^^(^1 pe 

Here H denotes the entropy function H{x) = — xln(x) — (1 — x) ln(l — x). 

The first ingredient in the proof of Theorem [1] is the following lemma, whose proof is presented 
in Section 13.11 

Lemma 6 For a random 3-Sat formula of density a > 1, and for every a € [0, 1], 

lim — In (E[Xa]) < sup /(a, o, r). 

n^oo n r 

A simple application of Markov's inequality immediately yields: 

Corollary 7 If a > 1 and a G [0, 1] are such that, for every r > 1 with d = ^In < 1, it holds 
that f{a,a,r) < 0, then with high probability random 5-Sat formulas of density a do not have 
covers of size an. 

The second ingredient in the proof of Theorem [1] is the following lemma, which is proved in 
Section 13.21 

Lemma 8 For a random 3-Sat formula of density a > 1, and for every a G [0, 1] and p G [0, 1), 
lim — In (E[Ya]) < sup(/(a, a, r) + min{0, sup /i(a, a, r, /j, 6)}). 

n^oo n r b 



6 



Remark: This lemma can be strengthened by allowing p to depend on r; however, we will not 
need to use this stronger version. 

Since every core is a cover we know that Ya < Xa ■ Hence (in light of Lemma [6|) Lemma [8] is 
interesting only when supf, /i(a, a, r, p, 6) is negative. In fact, we shall prove Lemma [8] by showing 
that this supremum bounds the logarithm of 1/n times the probability that a particular cover of 
size a can be extended to a satisfying assignment. (For a precise statement, see Lemma [TT] in 
Section [3.2i ) An immediate corollary of Lemma [8] is the following. 

Corollary 9 If a > 1, a £ [0, 1] and there exists p G [0, 1) such that for every r > 1 with 
d = ^ In < 1, and for every b £ [0,1 — a], it holds that f{a, a, r) + h{a, a, r, p, b) < 0, then with 
high probability random 5-Sat formulas of density a do not have cores of size an. 

The final ingredient in the proof of Theorem [1] is the following numerical claim, whose proof we 
discuss in Section 

Claim 10 For every a e [4.453,4.506], a G [1/(4.5066^), 1], and r > 1 with d = In < 1, it 
holds that either f{a, a,r) < or for every b £ [0,1 — a], f{a, a, r) + h[a, a, r, 0.4a + 0.7, b) < 0. 

Remark: In Claim[Tn]we have used the weight p = p{a) = 0.4a + 0.7. We arrived at this particular 
choice of p by first optimizing numerically for p at 100 values for a G [1/4.506, 1], and then fitting 
a simple function to the values for p that were found. Since Lemma [8] holds for every value of p, 
we may choose any convenient function. Using a simple analytic function guarantees that h is also 
analytic, which makes the numerical analysis of h easier. 

Finally, combining Claim [10] with Corollary [9] completes the proof of Theorem [TJ 

3.1 The expected number of covers: Proof of Lemma [6] 

Let s = [an\ . Then we have 

s) n {no{a) = 0)] 
0)] 

constrained] . 

We denote by P the probability that a = (0^ *^~^) is valid and all of its assigned variables 
are constrained. P is equivalent to the probability of the following event in an experiment with 
m = an balls thrown uniformly and independently at random into 2^(g) bins. There are 3 kinds 
of bins: 

1. Bins of type 1 should be empty. These correspond to clauses that are not allowed: 

• (xii Vxi2 Vxjg), with ii,i2,^3 £ {1,2, ...,s}; 

• (xj^ V V Xj), with ii,i2 G {1, 2, . . . , s} and j > s; 

• (xj^ V V Xj), with ii, ^2 G {1, 2, . . . , s} and j > s. 



E[Xa] = E 



n 



Ind [a is valid n (n* (cr 

cre{0,l,*}" 

Pr[(7 is valid n (no(o") 



(T£{0,l,*}":n, (a-)=n—s 



2" Pr[a = (0^ is valid and xi, . 
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The total number of these is 



J j + 2(n - .) Q =n' [j + a\l - a) j + o{n'). 

2. Bins of type 2 correspond to constraining clauses: (xjj Vxjj VXf), with zi, 22, t S {1, 2, . . . , s}. 
For each variable xj there are ('*2^) = \ + o{n?') clauses that could constrain it and at least 
one has to be included. Equivalently, there has to be at least one ball in one of those bins for 
every xt with t G {1, 2, . . . , s}. The total number of these clauses is: s(^2 ) = \ + o(n^). 



^3 



3. There are no constraints for the remaining bins, of type 3. Their total number is 

t + a\l-a)]-n^ - 
.3/ V 6 ^ V 2 



23(") (^ + a\l-a)]-n^- + o{n^) = — {A-a\2,-a))+o{n^). 



Suppose m! = dm of the clauses we choose are of type 2, and the remaining m — m! are of 
type 3. The probability of this event is 

P.'=y(^ + o(l)j [1 V^+o(l) 

The probability that the m! clauses of type 2 are such that there is at least one of each kind is 
the same as the coupon collectors probability of success, with s = [an\ different coupons, and 
m! = {da)n trials. We will use the following general fact from [4], which was previously used 
in a very similar context in [12]: Let q{cN,N) denote the probability of collecting coupons 
within cN trials. If c < 1, q{cN,N) = 0. Otherwise, as goes to infinity q{cN,N) grows 
like g{c)^ , where g{c) = and ro is the solution of rln^^:^^ = c. More precisely, 

limAT^^oo jr In {q{cN, N)) = ln{g{c)). We have 



m'=0 ^ ^ ^ 

.3 \"^' / „2 



q{m', s) 



Finally, 



E[Xa] < Q 2^minax | (-) + o(l))™' (l - + o{l))"' q{m' , .)} , 



and hence 



lim -ln(E[Xj) < Inf— _^ 



+ sup < 

d 



a In 

I V 



sup /(a, a, r) = sup /(a, a, r) 

d r 



This completes the proof of Lemma [UJ 
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3.2 The probability that a cover assignment is a core: Proof of Lemma [8] 

For a partial assignment a, it will be convenient to denote by ^Jo- (2^5* (o-) ) the formula that is obtained 
by substituting the variables that have 0/1 assignments in a, i.e., removing from ip clauses that 
are satisfied by at least one of the assigned variables and removing all remaining appearances of 
assigned variables. This is a formula on n*{a) variables. Notice that if a is a valid assignment 
for (p then the formula ipa contains no empty clauses and no unit clauses. Furthermore all clauses 
of type 2 (from the previous section) are removed, because they are satisfied by their constrained 
variables. Among the clauses of type 3, there are clauses that are removed, there are clauses that 
become two- variable clauses, and there are clauses that remain untouched. Since there is no simple 
way to describe the resulting distribution on formulas, we will keep referring to the set of all clauses 
of type 3, even the ones that are removed in ip(j. When we condition on the fact that o" is a cover 
and m! of the clauses are of type 2, as in the previous section, we know that the set of clauses we 
are interested in are distributed exactly as a uniform set of (m — m!) clauses of type 3. Thus we 
can express the expected number of cores as 



E[y(j] = Pr[(T is a cover] x Pr[(T is a core | o" is a cover 

cr£{0,l,*}" :n, (cF)=n—s 



n 



2* Pr[cT = (0* *" ^) is a cover] x Pr[a = (0* *" is a core | cr is a cover] 
2'' X ^ Pm' q{m',s) 

m'=s 

. . . , Xn) is satisfiable | cr — (0^ ni — m! clauses are of type 3]. 

We will bound this probability using the Poisson approximation, which is a standard technique 
in this area. There are two related random models. In the first model, which we call the exact 
model, {m — m') clauses are chosen uniformly at random with replacement from all M = n^(4 — 
a^(3 — a))/3 + o(n^) clauses of type 3. In the second model, which we call the Poisson model, each 
of the M clauses is included in the formula with probability p = {m — m')/M — l/(n^ylogn) = 
n^{i^a:^(^\a)) ~^ ^ (n^')' "^^^ expected number of clauses in both models is the same up to a term 
5 = M/(n^ -y/log n) = 0(n/-y/logn) which is sub-linear in n. 

The Poisson model has been studied before in the context of random 3-Sat. It can be shown, 
as the example below demonstrates, that whenever a property holds with high probability in the 
exact model, it also holds with high probability in the corresponding Poisson model. Applying first 
moment techniques to the Poisson model is usually easier, because the clauses are independently 
chosen; however the bounds obtained are usually weaker. 

Next, we relate the probability that ^p^j is satisfiable under the two models. Let Pr^ denote 
probability in the Poisson model, and Prg denote probability in the exact model. Let the random 
variable J denote the number of distinct clauses included in the formula. Then 

M 

Prp[(^o- is satisfiable] = Prpi-/ = i] x Pr[(^o- is satisfiable \ J = i] 

1=0 

m—m' — 5 

> Pipi"/ = i] x Pr[v3o- is satisfiable | J = i]. 

i=0 
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Since this conditional probability decreases as i increases, and Prp[J < Ep[J]] > 1/2, where Ep[J] = 
m — m' — 5, we have 

Pip[iPij is satisfiable] > Pr[(/9o- is satisfiable | J = m — m' 

> Pr[99o- is satisfiable | J = m — m' 
On the other hand, for the exact model 

Pre [(/Jo- is satisfiable] = 

< 
< 
< 

< 



Thus if the probability of satisfiability in the Poisson model is bounded above by c~" for some 
constant c, then lim„^oo :^ In (Pi'ei is satisfiable]) < c. Therefore, it suffices to get a bound for 
the Poisson model. Any of the first moment techniques for bounding the satisfiability threshold of 3- 
Sat can be adapted to bound the probability that ipa- is satisfiable. Of the ones that are technically 
applicable (i.e., result in an explicit analytic expression for every setting of the parameters s and 
m'), we obtain the strongest result using the novel approach of applying the first moment method 
to the distribution on partial assignments defined by the weights in equation ([T|). 

Lemma 11 In the Poisson model with parameters n, m = an, ml = dm, s = \an\ and for 
a = (0'^ *"^'^), and r such that d = ^ In 

lim — In (Prp[ • • • , x„) is satisfiable]) < inf sup h{a,a,r, p,b). 

PG[0,1) fee[o,l_a] 

Proof: We will apply Theorem [3] to the formula <pij{xs+i, ■ ■ ■ , Xn). The theorem implies that if ipu 
has a satisfying assignment then X^reV ^(''") — ^ where W{t) = p'^*^'^\l — y9)"°('^) and the sum is 
over the set V of all partial assignments r G {0, 1, *}'^~^ that are valid for (p^. Thus the probability 
of satisfiability is bounded from above by the expected value of X^^-gy W{t). This holds for any 
value of p e [0, 1]. We bound this expectation. 

For any t G {0, 1, . . . ,n — s} let Zt denote the sum of the weights of valid assignments r for 
ifa{xs+i, ■ ■ ■ ,Xn) such that n*(r) = n — s — t. Then 

n—s 

E\Z] = V ElZf] < n max E\Zt\. 



6]x Pr[J = i] 



i=0 



Pre[J = i] X Pr[(^o- is satisfiable \ J = i] 

1=0 

(m—m'—S \ 
Pre [J = + Pr[</9cr is satisfiable | J = m — m' — 6] 
i=o J 
Pre [at least 6 clauses repeated] + 2Fip[ipcr is satisfiable] 



m — m \ I m — m 



6 J \ M 

[m — my ^ 



+ 2 Prp \ip„ is satisfiable] 



1+2 Prp [(fcr is satisfiable] 
^(2-"v^) + 2Prp[v9o is satisfiable]. 
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E[Zt] = Yl - pr XI ^^^^^ ^ (n*(r) = n - s - i) n (no(r) = u) 

u=0 t6{0,1,*}"-s 

w=0 Te{0,l,*}"-= : n,(T)=n-s-t 



Prp[r = (0* *"'~'^~*) is valid and Xs+i, ■ ■ ■ ,Xs+u arc unconstrained] 



Next we derive the probability that the assignment {xg+i, Xs-\-2, ■ ■ ■ ,Xn) = (0* *"^*^*) is valid 
and the first u variables are unconstrained. Recall that tpa is obtained from if by setting its first s 
variables according to a. Only clauses of type 3 influence tpa and according to the Poisson model, 
each of them is included independently with probability p. The probability that the assignment 
{xs+i, Xs-\-2, ■ ■ ■ ,Xn) = (0* *"^*^*) is valid and the first u variables are unconstrained is equivalent 
to the probability that among the included clauses of type 3, there are no clauses of the following 
kinds: 

• (xjj V V Xjg), with ii G {1,2, . . . , s + t}, 12,13 £ {s + 1, s + 2, . . . , s + t}, 

• (xj^ V V Xj), with ii & {1,2, . . . , s + t}, i2 & {s + 1, s + 2, . . . , s + t} and j > s + t, 

• (xjj V Xjj V Xj), with ii E {1,2, . . . , s + t}, 12 {s + 1, s + 2, . . . , s + t} and j > s + t, 

• (xj^ VxjjVxj), with ii G {1,2, s + t}, 12 G {s + l,s + 2, . . . ,s + t} and j G {s + 1, . . . , s + u}, 

and for every j G {s + u + 1, . . . , s + t}, the formula contains a clause {xi^ V Xi^ V xj), with 
ii G {1, 2, . . . , s + t}, i2 G {s + 1, . . . , s + t}. 

In the Poisson model all clauses are independent, so it is easy to put these events together to 
obtain: 

Q = Prp[r = (0* is valid and Xg+i, ■ ■ ■ ,Xs-\-u are unconstrained] 

= (1 - p)iQ+<l)) + 2(-^-*)(G) + St) + u{Q+st) X (1 - (1 - p)Q+^*y"" 

The expression for the expectation can be simplified: 

^ ^ u=0 

= p"-^-* ~ 2* (1 -p)(3) + <2) + 2(n-.-t)((^) + St) 

Y^f^\ (i-p)«(G)+^*)x (i-(i-p)(2)+^*)' 

2* (l-p)il) + ^2) + 2(n-.-t)(G) + St) 

((1-P)(1-P)(2K^*+(1-(1-P)(2H*))* 

^n-.-t l^"""^*^ 2* (l_p)(2)(6n-M-3.-2)/3+2(n-.-0.t _ p(l _ Q+^*y 



t—u 



pU-s-t 



X 

n=0 
n — S 

X 
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Let t = bn, and recall that s = [an\ , p = n'^4-a^(3-a)) + ° i^)- Then 

V 1, ,r.r^n 1 fil-ay-" 2^ p^-''-''\ a {1 - d) b {b{6 - 5b - 3a) + 12{1 - a - b)a) 
lim — m(iij[ZtJj = m ' ' 



n^oon ^ ^ V - a - ^)^"""'' y 2(4-a2(3-a)) 

( 3a(l-d)b{b+2a) 
1- pe 2{4-a2(3-a)) 

= h{a,a,r, p,b). 

Finally, 

lim — In (Prp[ • • • , a^n) is satisfiablel) < lim — ln(E[Z])< sup h{a,a,r, p,b), 

n^oo n n^oo n 6e[0,l-a] 

which is the statement of Lemma [TTJ ■ 
Substituting the bound of Lemma [TT] into the expression for the expectation yields 

lim — In (E[ya]) < sup(/(a, a, r) + min{0, sup h{a, a, r, p{a, r), b)}), 

n^oo n r b 

completing the proof of Lemma [HI 

3.3 Numerical analysis: Steps to the proof of Claim 1101 

What remains is to verify the numerical claim that for every a G [4.453,4.506], a G [1/(4. 506e^), 1], 
and r > 1 with d = 7^ In < 1, it holds that either f{a,a,r) < or for every b £ [0, 1 — a], 
/(a, o, r) + h{a, a, r, 0.4a + 0.7, b) < 0. We outline the steps towards a proof. 

There are two stages. First, we identify the (a, r) pairs such that for every a G [4.453,4.506] it 
holds that /(a, a, r) < 0. Second, for the remaining range of values of (a, r) we show that for every 
6 G [0, 1 - a], f{a, a, r) + /i(a, a, r, 0.4o + 0.7, b) < 0. 

The derivative of / with respect to r is 



* V V r-lj r-lj \^2{i,-3 + o)ln(^; 

Since the first factor is always negative the derivative is when 

3« = 1"(73t) (^-6 + 2a + 3ar). (2) 

This equation has a unique root for every a £ [l/(4.506e^), 1], a G [4.453,4.506] (the derivative of 
the right-hand side is negative). Therefore we can conclude that for every a and a in the above 
range, / is at first increasing with r then decreasing. Its maximum is achieved at the root of 
equation ([2]). Figure [1] shows the values for r where / is maximized, with respect to a, when 
a = 4.453. 

Since the root of ^ is monotone decreasing with respect to a and a, we can find values ri 
and r2 such that for every a and a in the range, if r < ri then df/dr > 0, and if r > r2 then 
df/dr < 0. Values satisfying this condition are ri = 1.2 and r2 = 670. Thus if we show that 
f{a, a, r) is negative for r = ri and r = r2 then it is negative for any r outside the range (1.2, 670). 
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Figure 1: The pairs (a, r) such that /(4.453,a, r) > 
f {4.4:53, a, r) for every a. 



-0.0001, and the value of r that maximizes 



We will use the shorthand notation q := rln^r^. For r £ (1.2,670) we have q € (1.0007,2.16). 
The whole range satisfies the condition that d = — r In < 1. 

^ a r—1 — 

Next we take care of the boundary region with respect to a. Notice that the derivative of / is 
negative for a large enough (close to 1), because of the entropy term in /. The derivative of / with 
respect to a is 



— = ln(2)-ln- gin- 

da I — a 



+ 3q 



3a(a — aq){2 — a) 



2g(4-a2(3-a)) ^ 4-a2(3-a) 
The following observations are helpful for bounding this derivative: 



+ gln(r) — ln(r — 1). 



• 3q + qln(r) — ln(r — 1) is maximized in the interval r G [1.2, 670] at r = 1.2. 

• 0^(0 — aq)/{4 — a2(3 — a)) is an increasing function of a. 

• a{a - aq){2 - a)/{4 - a^{3 - a)) is an increasing function of a. 

For a > 0.999, using the above facts, the derivative is negative. Thus if we show that / is 
negative for a = 0.999, r G (1.2,670), a S [4.453,4.506] then / is negative also for every a > 0.999. 

We are left with the region a G [1/(4.5066^), 0.999], r G (1.2,670). In this region all the 
derivatives of / can be bounded, and a sufficiently fine grid can be chosen over which to evaluate 
/ in order to identify the grid sections where the function can take positive values. In particular, 
the derivative with respect to a is at most 28.2, and with respect to a it is at most 1. Furthermore, 
since we know that for every a and a, f is maximized as a function of r at the root of equation ([2]), 
one can find this maximum and the range of r where f{a, a, r) is positive using binary search. The 
points with /(4.453, a,r) > —0.0001 are depicted in Figure [H 

In the second stage we need to analyze f{a,a,r) + h{a,a,r,0.4a + 0.7,6) for the remaining 
region of values for (a, r). 
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First, notice that we can take care of the boundary regions with respect to b by taking advantage 
of the entropy term, which has very large slope for b close to 0, and very steep negative slope for b 
close to 1 — a. Specifically, the derivative with respect to b is 

dh , , ^ , . . , / b \ , ,,126 - 1562 - 30a6 + 12a - 120^ 
— = ln(2)-ln(p)-ln \ - a{\ - d)- 



db ' ' ''^^ \l-a-bj ' ' 2(4-a2(3-a)) 

+ ln(l-/,e-^)+3a(l-d)6 Pe-^{a + b) 



(l-pe-^)(4-a2(3-a)) 

where A - Mi~d)b(b+2a) 
wnere a - 2{A-a^3-a)) ■ 

Using the bounds on all parameters: a G [4.453,4.506], a G (0.28,0.7), r G (1.5,14.3), and the 

ones that follow from those: d G (0.06,0.26), A G [0, 1.59], p G (0.02,0.21), we can conclude that 

-3.04 - In ( 1 < < 5.38 - In 



1 — a — 6y db ' \1 — a — 

The lower bound can be made positive by setting 6/(1 — a — 6) < 0.04 and the upper bound can 
be made negative by setting 6/(1 — a — 6) > 220. Therefore it suffices to show that h is negative 
for 6 in the range [0.01, 0.996(1 - a)]. 

Again, using the bounds for the parameters, all the derivatives of f + h can be bounded, and a 
sufficiently fine grid can be chosen over which to evaluate f + h. We did not perform the evaluation 
on a grid that is as fine as is required for the rigorous proof because, based on the current bounds on 
the derivatives, we would need to evaluate the function at more than lO^'' points. (As we discussed 
in the introduction, we feel that the computational effort is not yet justified.) 

For illustration. Figure [2] shows the estimated values of sup^ /(a, a, r) and of sup^ ;,(/(a, a, r) + 
h{a,a,r, p(a),b)) for a = 4.453, 4.470, 4.490, and 4.506. The maximum is taken over evaluations 
at a grid of step size 0.001 for the parameters r and 6. We estimated with better precision the 
location of the maximum of / + /i by evaluating the function at a fine grid in the region where 
the evaluations on the coarse grid give the largest values. The maximum found in this way is at 
a = 4.453, a = 0.62566, 6 = 0.03568 and r = 2.00134 {d = 0.19473). The value of / + /i at this 
point is -0.000058. 



4 Concluding remarks 

As mentioned in the introduction, all of the known rigorous upper bounds for the satisfiability 
threshold of 3-Sat are based on the first moment method [lOl O [131 El HI E]; the corresponding 
upper bounds in this sequence of results are: 4.758, 4.64, 4.601, 4.596, 4.571, 4.506. The general 
method is to consider a random variable Z that is equal to the number of satisfying assignments 
of a particular kind. These satisfying assignments are such that at least one exists if the formula is 
satisfiable. Showing that above a certain density E[Z] — > thus implies (by Markov's inequality) 
that Pr[Z = 0] ^ 1, and consequently the probability that the formula is unsatisfiable also goes 
to 1. For example, [5] takes Z to be the number of negatively prime solutions, i.e., solutions for 
which every variable assigned 1 is constrained. 

Here, we have used the same idea but with Z taken to be the total weight of partial assignments 
under the weight function ([1]) inspired by Survey Propagation. Given the dramatic success of Survey 
Propagation for random 3-Sat, it seems plausible that this approach can potentially yield rather 
tight upper bounds on the threshold. We were able to achieve only partial progress in this direction, 
but we are quite hopeful that extensions of our approach could lead to further progress. 
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0.02 



-0.02 



-0.06 




Figure 2: The values of sup,, /(a, a, r) and sup,. ;,(/(«, a, r) + /i(a, a, r, 0.4a + 0.7, b)) for a = 4.453, 
4.470, 4.490, and 4.506. 

One natural extension would be to use a different weight p for each variable, depending for 
example on the number of positive and negative occurrences of the variable. The corresponding 
generalization of Theorem [5] is proved in |2j. It is quite possible that the value of E[Z] in this case 
is significantly smaller. 
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